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ABSTRACT 

We construct a galaxy groups catalogue from the public lOOK data release of the 
2dF galaxy redshift survey. The group identification is carried out using a slightly 
modified version of the group finding algorithm developed by Huchra & Geller. Several 
tests using mock catalogues allow us to find the optimal conditions to increase the 
reliability of the final group sample. A minimum number of 4 members, an outer 
number density enhancement of 80 and a linking radial cutoff of 200 km sec~^, are 
the best obtained values from the analysis. Using these parameters, approximately 
90% of groups identified in real space have a redshift space counterpart. On the other 
hand the level of contamination in redshift space reaches to 30 % including a ^ 6% 
of artificial groups and ~ 24% of groups associated with binaries or triplets in real 
space. The final sample comprise 2209 galaxy groups covering the sky region described 
by CoUess et al. spanning over the redshift range of 0.003 < z < 0.25 with a mean 
redshift of O.f . 
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1 INTRODUCTION 

The study of galaxy groups is a very interesting area of re- 
search because these density fluctuations lay between galax- 
ies and clusters of galaxies and may provide important clues 
to galaxy formation. 

Since the appearing of large galaxy redshift surveys 
more reliable detection of group of galaxies have been possi- 
ble. Two pioneer works develop the main identification algo- 
rithms. The first was introduced by Huchra & Geller (1992) 
and the second was proposed by Nolthenius & White (1987); 
both are friends-of-friends algorithms, differing only in the 
scaling of the linking lengths. Frederic (1995) perform an ex- 
tensive study of these algorithms using N-body simulations, 
concluding that neither of them is intrinsically superior to 
the other, and the choice of one of them depends on the 
purpose for which groups are to be studied. Recently sev- 
eral catalogues have been constructed from different large 
galaxy redshift surveys using these algorithms. Merchan et 
al. (2000) use the Updated Zwicky Catalogue (Falco et al. 
1999) generating a sample of 517 galaxy groups. Using the 
Nearby Optical Galaxy Sample, Giuricin et al. (2000) con- 
struct one of the large catalogue of loose groups and Tucker 
et al. (2000) found 1495 groups in the Las Campanas Red- 
shift Survey. Finally, Ramella et al. (2002) generate a group 
catalogue of 1168 members from the combination of the Up- 
dated Zwicky Catalogue and the Southern Sky Redshift Sur- 
vey (da Costa et al. 1998). 

At the present, the largest sample of galaxies with spec- 



troscopic redshifts (~ 100000) is the Anglo-Australian two 
degree field galaxy redshift survey (hereafter 2dFGRS, Col- 
less et al. 2001). The survey covers 2000 deg^ and has a 
median depth z = 0.11. 

Several studies were done with this preliminary sample 
by the 2dFGRS Team. Some of then are the power spectrum 
of galaxies (Percival et al. 2001), the projected two point 
correlation function of galaxies for different volume limited 
samples (Norberg et al. 2001a), and the estimation of the bj 
band galaxy luminosity function (Norberg et al. 2001b). An- 
other study related with system of galaxies in the 2dFGRS 
has been carried out by De Propris et al. (2002) where they 
present an study of 3-dimensional galaxy clusters based on 
well known bidimensional galaxy clusters. 

The large number of galaxies and the depth of the 2dF- 
GRS make it very suitable for galaxy groups identification. 
Nevertheless the present release of the sample is not uniform 
due to variations in sky coverage which make difiicult the 
analysis of the data. 

The main objective of this work is the identification 
of groups in the 2dFGRS and estimation of their physical 
properties (number of members, velocity dispersion, virial 
radius and mass). In order to take into account the vari- 
ation in the sky coverage of the 2dFGRS we introduce a 
couple of modifications to the group-finding algorithm de- 
veloped by Huchra & Geller (1982). The first one deals with 
the apparent magnitude limit variation across the sky and 
the second is related to redshift completeness. These modi- 
fications are tested using mock catalogues which reproduce 
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Figure 1. AitofT projections of the 2dFGRS. The upper projection represent the distribution of galaxies in equatorial coordinates in the 
northern strip and the lower the southern one. 



each effect separately. We also use these mock catalogues to 
explore the space of linking parameters to maximize group 
identification accuracy. 

The paper is structured as follows. The catalogue is de- 
scribed in section 2. In section 3 we describe the group finder 
algorithm used to identify galaxy groups in the 2dFGRS. A 
detailed test of the method is performed in section 4. Finally, 
the 2df group catalogue is presented in section 5 followed by 
conclusions in section 6 . 



2 THE GALAXY CATALOGUE 

The complete 2dFGRS will consist of approximately 250000 
galaxies with redshifts in two declination strips plus 100 
random 2-degree fields. All targets are selected in the pho- 
tometric bj band from the APM galaxy survey (Maddox 
et al. 1990a,b;1996). The southern strip (SGP) covers ap- 
proximately 1275 square degrees (—37°. 5 < 5 < —22°. 5; 
21'' 40"" < a < 3''30'") and the northern strip (NGP) cov- 
ers 750 square degrees (-7°. 5 < S < 2°. 5; 9''50'" < a < 
14'' 50'") while the 100 random fields are spread uniformly 
over the 7000 square degrees in the southern region. 

In our analysis we use the 2dF public lOOK data release 
of galaxies with the best redshift estimates within the two 
main strips (NGP and SGP) of the catalogue. Given the 



status of the project, the sky coverage of the sample is not 
uniform, so a detailed completeness description is needed 
(see Figure |l]). For this purpose, we use in our analysis the 
2dFGRS mask software constructed by Peder Norberg and 
Shaun Cole which take into account both the magnitude 
limit and the redshift completeness (see also Figure 13 and 
15 of CoUess et al. 2001). The redshift completeness was 
defined with the 2° field used to tile the survey region for 
spectroscopic observations. This quantity is the ratio of the 
number of galaxies for which redshifts have been obtained 
to the total number of objects contained in the parent cata- 
logue and has a mean value of ~ 0.75 for the whole sample. 
On the other hand, the magnitude limit mask corresponds 
to variations of the parent survey magnitude limit with the 
position on the sky. This variation span over a magnitude 
limit range of m = 18.95 — 19.55 for the NGP and a range of 
m = 19.3-19.55 for the SGP. This sample comprise 102426 
galaxies with final bj magnitudes corrected for galactic ex- 
tinction. 



3 THE GROUP-FINDING ALGORITHM 

For group identification we use a friends-of-friends algorithm 
similar to that described by Huchra & Geller (1982) modi- 
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fied in order to take into account redshift completeness and 
magnitude limit variations. 

If we have a pair of galaxies with mean radial velocity 
V — {Vi + V2)/2 and angular separation 612, our algorithm 
links galaxies when the following conditions are satisfied: 



D12 
and 



2 sin 
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< Dr 



Vu = \Vi - V2\ < Vl 



(1) 



(2) 



where D12 is the angular separation and V12 is the line-of- 
sight velocity difference. The transverse Dl and radial link- 
ing lengths Vl are scaled as Do R and Vo R respectively in 
order to compensate for the decline of the selection function 
with distance. The scaled factor is computed as 



R = 



-, -1/3 



(3) 



where Mum and M12 are the absolute magnitude of the 
brightest galaxy visible at a distance Vf/Ho and V/Ho re- 
spectively. In these equations, Do and Vo are the linking 
cutoffs at the fiducial velocity Vf, and 4>{M) is the galaxy 
luminosity function of the sample. 

In the special case of the 2dFGRS, we should take into 
account the magnitude limit and completeness variations 
through the sky. The adopted way to solve this issue consists 
in defining an average magnitude limit for each pair 



mum = {mum + mum)/2 

and redefining the scale factor R as 



R = 



I"l'm)dM ^c,+c2) 



-, -1/3 



(4) 



(5) 



where Ci and C2 are the corresponding completeness values 
for each galaxy position on the sky. 



4 TESTING THE METHOD 
4.1 Mock Catalogues 

To examine the degree of accuracy of our algorithm we use 
a set of mocks catalogues constructed from a gravitational 
numerical simulation of a flat low density cold dark mat- 
ter universe. We perform this simulation using the Hydra 
Nbody code developed by Couchman et. al (1995) with 128^ 
particles in a cubic comoving volume of 180 Mpc per 
side starting at z=50. The adopted cosmological model was a 
universe with Qm = 0.3, Qa ~ 0.7, Hubble constant h = 0.7 
and a relative mass fluctuation of as = 0.9. 

In order to reproduce the same radial distribution as in 
the 2dFGRS, we adopt a galaxy luminosity function fitted 
by a Schechter function with M^. — blogioh = —19.66, a = 
-1.21, $* = 1.68x10 
k-f e correction given by the formula 



^h^Mpc and a model of the average 



k{z) + e{z) 



1 + 20^3 



(6) 




Figure 2. The percentage of groups found in redshift space which 
match with some group in real space. These percentage are shown 
spanning the parameter space Vq (100, 200, 300, 400 and 500 
fcm-l) 5p/p (20-filled circles, 40 -open circles, 80-filled triangles 
and 160-open squares). Each panel show the different kind of 
mocks constructed. 
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Figure 3. The percentage of the group catalogue in real space 
associated with the catalogue of matched groups in redshift space. 
The symbols and the panels are the same as in Figure bl 



(see Norberg et al. 2001b). Combining these models and 
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using the positions from the simulation we generate an ap- 
parent magnitude 

m = k + e + 5logio{dL/h~^Mpc) + 25 + {Mbj - Slog^, /i)(7) 

for each particle inside the angular mask described in sec- 
tion 2. We construct four mock catalogues with different sky 
coverage: 

• in the first mock we introduce a fixed faint survey mag- 
nitude limit (m); 

• the second mock has a fixed faint survey magnitude 
limit and the effect of redshift completeness as in the real 
survey (c — m); 

• the third mock has a faint survey magnitude limit which 
changes with the angular position of a particle in the same 
way as the magnitude limit mask of the 2dFGRS (m„); 

• finally, the fourth mock have both effects, the magni- 
tude limit mask and the redshift completeness mask (c— m„). 

4.2 Identification Accuracy 

From the mock catalogues described in the previous section, 
we identify groups in real and redshift spaces. The former 
identification is carried out using the same linking length in 
both directions (Vb = Do) whereas different linking lengths 
are used in redshift space. The fiducial velocity V/ is adopted 
as 1000 km in all identifications. The linking lengths are 
selected spanning the space parameter (Vb, Sp/p) where the 
later is the number density contour surrounding a group and 
corresponds to a fixed number density enhancement relative 
to the mean number density of 

7 ^sij (£""*'"""') 

The selected values are: 

• Vo : 100, 200, 300, 400 and 500 km s'^ 

• Sp/p : 20, 40, 80 and 160 

To quantify how good are the groups identified in red- 
shift space we implement a method to match the center of 
mass of these groups with the center of mass of groups in 
real space. Our method seeks for groups in real space within 
a projected and a radial distance having its origin in the cen- 
ter of mass of each group in redshift space. We choose the 
searching parameters in order to maximize the matching, 
nevertheless the results are very stable around the chosen 
values. The results show that almost all groups have one 
match inside the limits and only a few are related with two 
or three groups in real space. In the later situation we choose 
to associate the group in redshift space with that group in 
real space with more shared particles. 

The first result from this comparison is that the per- 
centage of spurious groups identified is remarkably higher 
when we include groups in redshift space with 3 members 
(~ 40%). RameUa, Geller & Huchra (1989) and Frederic 
(1995) obtain that one-third or more of the groups identified 
in redshift space with N=3 are spurious, which is roughly 
consistent with our result. Consequently, we adopt as the 
main catalogue all groups identified in redshift space with 
more than 4 members. Figure ^ shows the number of groups 
which match with one group in real space relative to the 
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Figure 4. Fraction of groups of a given richness Ntot with J^. 
Cross hatched regions show a ratio of unity, single narrow 
hatched regions correspond to 0.75 < .7-" < 1, 0.5 < ^ < 0.75 
to single wide hatched regions and no hatching regions belong to 
0.25 < ^ < 0.5. For richness 10 we include all groups with 10 
or more members. The labels over the bars show the number of 
groups for the corresponding richness. 



total number of groups in the main catalogue. The anal- 
ysis comprises all mock and identification parameters un- 
der consideration. The complementary percentage in each 
case represents the spurious groups in the main catalogue. 
We consider spurious those groups which do not have any 
match or are associated with a binary or a triplet in real 
space. The individual percentages for these spurious groups 
for the c — m« mock are shown in Table 1 where the first 
line represents no matched groups, the second and third lines 
correspond to binaries and triplets respectively. 

We also analise the percentage of the group catalogue 
in real space associated with matched groups in the main 
catalogue. That percentage is plotted in Figure |^ for all the 
options as in Figure ^ Despite the percentage of matched 
groups in the main catalogue seems to be the highest for 
a velocity of 100 km and density contrast of 160, it 
can be seen from Figure ^ that this choice represents one of 
the lowest percentages of the group catalogue in real space. 
Combining the two figures we can conclude that the best 
options are a velocity of 200 km and density contrast of 
80 or 160. We choose a density contrast of 80 because the 
total number of groups identified is greater and the matching 
accuracy is roughly similar. 

The resulting value of Jp/p = 80 is in agreement with 
that obtained by Ramella et al. (1997) for the CfA2 Redshift 
Survey, whereas our optimal velocity linking parameter Vo — 
200 km differs from his choice (Vb ~ 350 km s~^). 
Should be taken into account that equal Sp/p correspond to 
different values of Do depending of the apparent magnitude 
limit of the survey. Particularly, the the apparent magnitude 
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-2.5"<6< 2.5° -29.0°<6<-22.5" 



Figure 5. The redshift distribution for the galaxy groups identified in the 2dFGRS. The pieplot show at the left the NGP of the 2dFGRS 
and the SGP at right. The dots are the galaxies and the open circles the galaxy groups. To maJce the plot clearer we limit the plot to 
z < 0.15 
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Table 1. Percentage of spurious groups identified in the c — 
mock 
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Figure 6. Upper and lower panels show the distribution of the 
main physical properties of groups at low and high completeness 
regions respectively. Solid and doted lines correspond to groups 
with and without border effects. 



limits for the 2dFGRS and the CfA2 Redshift Survey are ~ 
19.4 and 15.5 respectively. Consequently, the higher number 
density of the 2dFGRS implies smaller linking parameters 
than those obtained for the CfA2 Redshift Survey. 

From the comparison between the four kind of mock 
catalogues we conclude that implemented modifications keep 
the behavior of the original algorithm, showing the ability of 



these modifications to deal with an inhomogeneous galaxy 
samples like the 2dFGRS. 

To examine the degree of agreement between members 
of groups in real and redshift space we perform an analysis 
similar to Frederic (1995) for matched groups in the main 
catalogue. If we define the total number of members of a 
given group in redshift space as Ntot, we search how many 
members of this group are in the associated group in real 
space Nm and compute the ratio = Nm/Ntot- Figure ^ 
shows the fraction of groups of a given richness Ntot with T 
for several values: 1 (cross hatched regions), 0.75 < J- < 1 
(single narrow hatched regions), 0.5 < J- < 0.75 (single wide 
hatched regions) and 0.25 < J- < 0.5 (no hatching regions). 
For richness 10 we include all groups with 10 or more mem- 
bers. As it can be seen, more than ~65% of the groups have 
a ratio T greater than 0.75 for all kind of mocks. It also 
should be noticed that the implemented modifications keep 
the performance of the original Huchra & Geller algorithm. 

We still need to test that our modifications to the 
Huchra & Geller algorithm improve the detection of galaxy 
groups. In order to prove the modification which take into 
account the redshift completeness we compare the results of 
the identification using both, modified and original Huchra 
& Geller algorithms on the c — m mock catalogue. As result, 
we find that the identification using our modification pro- 
duce 263 (~ 11% of the total sample) more groups than the 
corresponding to the plain algorithm version. On the other 
hand, the wide apparent magnitude limit range implies a 
variation in the local number density, which produce a simi- 
lar effects as the redshift completeness. This effect was tested 
applying a similar procedure as the former but using the m„ 
mock catalogue. The results show a difference of 89 groups 
using our modified algorithm against the original Huchra & 
Geller one. The effect due to the apparent magnitude limit 
is smaller than the observed for the redshift completeness 
because the magnitude limit distribution have a mean value 
of ~ 19.4 tending to the upper limit of the whole range. Fi- 
nally, we test the joint effects applying the same procedure 
to the c — rriv mock catalogue, showing that our modifica- 
tion identify 280 more groups (~ 14% of the total sample). 
As we have shown in the previous tests, the modified algo- 
rithm keep the performance of the original Huchra & Geller 
method, then the ~ 70% of these differences correspond to 
real groups for the chosen linking parameters. 



5 GALAXY GROUP IDENTIFICATION IN 
THE 2DFGRS 

To identify groups in the 2dF survey (2dFGGC), we adopt 
the values Sp/p — 80 and Vo ~ 200 km which maximize 
the group accuracy as shown in section 4. 

The resulting groups catalogue contains systems with 
at least 4 members, mean radial velocities in the range 
gOOfcms"^ <V< 75000fc7ns"^ and a total number of 2209 
groups. It represents the largest sample to the present and 
provides a suitable data set to analise the clustering prop- 
erties of galaxy systems of low richness. As discussed in the 
previous section, the limit adopted in the number of mem- 
bers in galaxy groups is necessary in order to avoid pseudo- 
groups. 
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Table 2. Mean parameters of the 2dFGGC 
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Figure 7. The upper-left panel shows the histogram describing 
the radial distribution for the groups identified in the 2dFGRS. 
The upper-right panel shows the distribution of groups velocity 
dispersions. The lower-left panel plots the virial mass distribution 
whereas the lower-right panel shows the distribution of groups 
virial radii. 



Figure |5| shows the cone diagrams for galaxies (points) 
and groups (open circles) in the 2dF survey. 

Estimation of the virial mass for galaxy groups is carried 
out using the following equation 



G 



(9) 



where Ry is the virial radius of the system and a is the 
velocity dispersion of member galaxies (Limber & Mathews 
1960). The virial radius is estimated as 



R\ 



Rpv = 



1) 



(10) 

(11) 



where Rpv is the projected virial radius, Ng is the number 
of galaxies members and Rij the galaxy projected distances. 
The velocity dispersion a is estimated using their observa- 
tional counterpart, the line-of-sight velocity dispersion a^, 
a = y/3av In particular we use the methods described by 
Beers, Flynn & Gebhardt (1990) to obtain a robust estima- 
tion of the radial velocity dispersion. We apply the biweight 
estimator for groups with richness Ntot > 15 and the gapper 
estimator for poorer groups (Girardi et al. 1993, Girardi & 
Giuricin 2000). Despite these methods improve the veloc- 
ity dispersion estimation in terms of efficiency and stability 
when dealing with small groups, it should be taken into ac- 
count that small errors in the estimate of the velocity disper- 
sion results in correspondingly larger errors in the derived 
mass. 

The group properties estimation could be biased by the 
geometrical and sampling effects in the 2dF catalog. To deal 
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with border effects we create a simple magnitude limited 
mock catalogue where limits in declination and right as- 
cension exceed the corresponding to the 2dF catalog. Con- 
sequently we identify groups in this mock catalogue keep- 
ing those inside the 2dF border and we confront them with 
groups identified in a mock catalogue which emulate the 2dF 
geometry. As result, we observe no differences between the 
distribution of physical properties {Rv,cf and M) of these 
samples, showing that the complex geometry of the 2dF bor- 
der do not affect group properties. 

Another source of bias could be the variation of redshift 
completeness across the sky producing a wrong estimation 
of the physical properties in low completeness regions. In or- 
der to quantify this possible bias, we compare the physical 
properties of both, c — m and m mock catalogues described 
in section 4, splitting these samples in high and low redshift 
completeness. In Figure |^ we show the histogram of Rv, cr 
and M for low (< 0.6, upper panels) and high (> 0.6,lower 
panels) completeness. Dotted lines correspond to m mock 
catalogue whereas the solid lines correspond to c— m one. As 
it can be seen, only the virial radius in the low redshift com- 
pleteness regions show some differences, nevertheless they 
do not produce any significant effect in the mass estimation. 

Finally the distribution of the estimated physical prop- 
erties are shown in Figure ^. In the upper-left panel we plot 
the redshift distribution with a mean redshift of 2 ~ 0.1, 
the lower-left panel shows the virial mass distribution, the 
upper-right panel correspond to the velocity dispersion dis- 
tribution, and the virial radii distribution is shown in the 
remainder plot. The mean values of the 2dFGGC are quoted 
in Table 1. 



6 CONCLUSIONS 

In this paper we present a new catalogue of galaxy groups 
derived from the 2dFGRS, the largest sample of galaxies 
with spectroscopic redshifts at the present. The construc- 
tion of this catalogue of groups (2dFGGC) was carried out 
introducing some modifications in the friend of friends al- 
gorithm developed by Huchra & Geller (1982) in order to 
take into account the different sky coverage variations in 
the sample. We have tested these modifications taking ad- 
vantage of mock catalogues which allow us to explore the 
space parameter looking for the best values to maximize the 
group identification accuracy in redshift space. With these 
parameters {5p/ p = 80 and Vb = 200 km sec~^) we obtain 
that ~ 90% of groups in real space match some group with 
at least four members in redshift space. Furthermore, the 
~ 30% of groups in redshift space are spurious (no matched 
groups ~ 6% and groups associated with a binary or triplet 
~ 24%). We also find that the introduced modifications are 
able to detect ~ 14% more groups than the obtained by 
the original Huchra & Geller algorithm. Additionally, we 
explore the border effect and a possible bias due to redshift 
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completeness on the physical properties, finding that none 
of them produce significant effects on the final estimations. 

Using the optimal identification parameters, the final 
group sample for the 2dF survey comprise 2209 groups of 
galaxies with mean redshift of z ~ 0.1 and a mean velocity 
dispersion of a=261 km/s. From the estimation of the virial 
masses for the galaxy groups we obtain a mean mass M = 
8.5 X 10^^ h-^ Mq with a mean virial radius Rv = 1.12 
h-^ Mpc. 

The new galaxy group catalogue is one of the largest 
groups samples until the present. For this reasons the new 
sample is more suitable for statistical studies than has been 
previously available. 
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